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Abstract 

We present a general method of designing fast approximation algorithms for cut-based min- 
imization problems in undirected graphs. In particular, we develop a technique that given 
any such problem that can be approximated quickly on trees, allows approximating it almost 
as quickly on general graphs while only losing a poly-logarithmic factor in the approximation 
guarantee. 

To illustrate the applicability of our paradigm, we focus our attention on the undirected 
sparsest cut problem with general demands and the balanced separator problem. By a simple 
use of our framework, we obtain poly-logarithmic approximation algorithms for these problems 
that run in time close to linear. 

The main tool behind our result is an efficient procedure that decomposes general graphs 
into simpler ones while approximately preserving the cut-flow structure. This decomposition is 
inspired by the cut-based graph decomposition of Racke that was developed in the context of 
oblivious routing schemes, as well as, by the construction of the ultrasparsifiers due to Spielman 
and Teng that was employed to preconditioning symmetric diagonally-dominant matrices. 

1 Introduction 

Cut-based graph problems are ubiquitous in optimization. They have been extensively studied - 
both from theoretical and applied perspective - in the context of flow problems, theory of Markov 
chains, geometric embeddings, clustering, community detection, VLSI layout design, and as a basic 
primitive for divide-and-conquer approaches (cf. [38]). For the sake of concreteness, in this paper 
we define an optimization problem V to be an (undirected) cut-based (minimization) problem if 
its every instance P € V can be cast as a task of finding - for a given input undirected graph 
G = (V, E, u) - a cut C* such that: 



C* = argmin ^ CcV «(C)/ P (C), (1) 

where u(C) is the capacity of the cut C in G and fp is a non-negative function that depends 
on P, but not on the graph G. It is not hard to see that a large number of graph problems fits this 
definition. For illustrative purposes, in this paper we focus on two of them that are fundamental 
in graph partitioning: the generalized sparsest cut problem and the balanced separator problem. 
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In the generalized sparsest cut problem, we are given a graph G = (V, E, u) and a demand graph 
D = (V,E[),d). Our task is to find a cut C* that minimizes the (generalized) sparsity u(C)/d(C) 
among all the cuts C of G. Here, d(C) is the total demand YleeE D (c) ^( e ) °^ ^ ne demand edges 
Ed(C) of D cut by the cut C. Casted in our terminology, the problem is to find a cut C* being 
argmin c «(C)/ P (C) with f P (C) := l/d(C). 

An important problem that is related to the generalized sparsest cut problem is the spars- 
est cut problem. In this problem one aims at finding a cut C* that minimizes the sparsity 
u(C)/min{|C|, \C\} among all the cuts C of G. Note that if we considered an instance of gen- 
eralized sparsest cut problem corresponding to the demand graph D being complete and each 
demand being 1/\V\ (this special case is sometimes called the uniform sparsest cut problem) then 
the corresponding generalized sparsity of every cut C is within factor of two of its sparsity. There- 
fore, up to this constant factor, one can view the sparsest cut problem as a special case of the 
generalized sparsest cut problem. 

The balanced separator problem corresponds to a task of finding a cut that minimizes the sparsity 
among all the c-balanced cuts C in G, i.e. among all the cuts C with min{|C|, \C\} > c\V\, for some 
specified constant c > called the balance constant of the instance. In our framework, the function 
fp(C) corresponding to this problem is equal to 1/ min{|C|, \C\} if min{|C|, |C|} > c\V\ and equal 
to +00 otherwise. 

1.1 Previous Work on Graph Partitioning 

The captured by the above-mentioned problems task of partitioning a graph into relatively large 
pieces while minimizing the number of edges cut is a fundamental combinatorial problem (see [38J) 
with a long history of theoretical and applied investigation. Since most of the graph partitioning 
problems - in particular, the ones we consider - are NP-hard, one needs to settle for approximation 
algorithms. In case of the sparsest cut problem and the balanced separator problem, there were 
two main types of approaches to approximating them. 

First type of approach was based on spectral methods. Inspired by Cheeger's inequality |17j 
discovered in the context of Riemannian manifolds, Alon and Milman [3] transplanted it to discrete 
setting and presented an algorithm for the sparsest cut problem that can be implemented to run 
in nearly-linear tim^], but its approximation ratio depends on the conductanc^l < &(G) of the graph 
and can be as large as fi(rt) in the worst case. Later, Spielman and Teng |4CH I42j (see also [5] 
and [6] for a follow-up work) used this approach to design an algorithm for the balanced separator 
problem. However, both the running time and the approximation guarantee still depend on the 
conductance of the graph, leading to an Q(n) approximation ratio and large running time in the 
worst case. 

The second type of approach relies on flow methods. It was started by Leighton and Rao |32j 

1 The key algorithmic step of [4] is computation of a vector that is orthogonal to all-ones vector and (up to 
sufficient precision) minimizes the energy of the Laplacian of the graph. The canonical approach to this task leads 
to an algorithm running in 0(m/$ 2 ) time. To get no dependence on $ in the running time, one notes that finding 
a vector that, say, 1/2-approximately maximizes the energy of the pseudo-inverse of the Laplacian is sufficient for 
[4] to work. One can compute such a vector in 0(m) time by employing the algorithm of [31] and avoiding explicit 
computation of the pseudo-inverse via appropriate use of the nearly-linear Laplacian system solver of Spielman and 
Teng [40]. 

2 Conductance <&(G) of a graph G — (V,E,u) is defined as 3>(G) := mmQ^ccv ttt^tttwt! where U(C) := 
roles vec u (i v > w )) i s the total capacity-weighted degree of the vertices in C. 
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who used linear programming relaxation of the maximum concurrent flow problem to establish first 
poly-logarithmic approximation algorithms for many graph partitioning problems. In particular, 
an 0(log n)-approximation for the sparsest cut and the balanced separator problems. Both these 
algorithms can be implemented to run in 0(n 2 ) time. 

These spectral and flow-based approaches were combined in a seminal result of Arora, Rao, and 
Vazirani [10] to give an (9(y / Iogn)-approximation for the sparsest cut and the balanced separator 
problems. 

In case of the generalized sparsest cut problem, the results of Linial, London, and Rabinovich 
[33], and of Aumann and Rabani [11] give a 0(log r)-approximation algorithms, where r is the num- 
ber of vertices of the demand graph that are endpoints of some demand edge. Subsequently, Chawla, 
Gupta, and Racke [16] extended the techniques from [10] to obtain an 0(log 3 / 4 r)-approximation. 
This approximation ratio was later improved by Arora, Lee, and Naor [9] to O ( \f\og r log log r ) . 

1.2 Fast Approximation Algorithms for Graph Partitioning Problems 

More recently, a lot of effort was put into designing algorithms for the graph partitioning problems 
that are very efficient while still having poly-logarithmic approximation guarantee. Arora, Hazan, 
and Kale [7J combined the concept of expander flows that was introduced in [10] together with 
multicommodity flow computations to obtain 0(\/\og n)-approximation algorithms for the sparsest 
cut and the balanced separator problems that run in 0(n 2 ) time. 

Subsequently, Khandekar, Rao, and Vazirani [30j designed a primal-dual framework for graph 
partitioning algorithms and used it to achieve 0(log 2 n)-approximations for the same two problems 
running in 0(m + ra 3 / 2 ) time needed to perform graph sparsification of Benczur and Karger |14j 
followed by a poly-logarithmic number of maximum single-commodity flow computations. In [8], 
Arora and Kale introduced a general approach to approximately solving semi-definite programs 
which, in particular, led to 0(log n)-approximation algorithms for the sparsest cut and the balanced 
separator problems that also run in 0(m + n 3//2 ) time. Later, Orecchia, Schulman, Vazirani, and 
Vishnoi [M] obtained the same approximation ratio and running time as [8] by extending the 
framework from [30J. Recently, Sherman [37] presented an algorithm that for any e > works 
in 0(m + n 3 / 2+e ) time - corresponding to performing sparsification and 0(n £ ) maximum single- 
commodity flow computations - and achieves an approximation ratio of 0(y/\ogn/e). Therefore, 
for any fixed e > 0, his algorithm has the best-known approximation guarantee while still having 
running time close to the time needed for maximum single-commodity flow computation. 

Unfortunately, despite this substantial progress in designing efficient poly-logarithmic approx- 
imation algorithms for the sparsest cut problem, if one is interested in the generalized sparsest 
cut problem then a folklore 0(log r)-approximation algorithm running in 0(n 2 log U) time is still 
the fastest one known. This folklore result is obtained by combining an efficient implementation 
[144 124"] of the algorithm of Leighton and Rao [32] together with the results of Linial, London, and 
Rabinovich [33] , and of Aumann and Rabani [11] . 

1.3 Our Contribution 

In this paper we present a general method of designing fast approximation algorithms for undirected 
cut-based minimization problems]! In particular, we design a procedure that given an integer 

3 In fact, as we briefly mention in section POl our framework can be applied to even more general class of problems: 
the multicut-based minimization problems. 
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(Uniform) sparsest cut and balanced separator problem: 
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Figure 1: Here, n denotes the number of vertices of the input graph G, m the number of its edges, 
$ its conductance, U is its capacity ratio, and r is the number of vertices of the demand graph 
D = (V,Ef),d) that are endpoints of some demand edges. Also, O(-) notation suppresses poly- 
logarithmic factors. The algorithm of Alon and Milman applies only to the sparsest cut problem. 



parameter k > 1 and any undirected graph G with m edges, n vertices, and having integral edge 
capacities in the range [1, . . . , U], produces in <D(m + 2 k n l+l ^ 2 ~^ log U) time a small number of 
trees {Tj}j. These trees have a property that, with high probability, for any a > 1, we can find an 
(alog^ 1+ °^^ fc n)-approximation to given instance of any cut-based minimization problem on G by 
just obtaining some a-optimal solution for each T{ and choosing the one among them that leads 
to the smallest objective value in G (see Theorem 14.11 for more details). As a consequence, we 
are able to transform any a-approximation algorithm for a cut-based problem that works only on 
tree instances, to an (a log^ 1+0 ^^ fc n)-approximation algorithm for general graphs, while paying a 
computational overhead of 0(m + 2 k n 1+l ^ 2k ~ 1 ^ logC/) time that, as k grows, quickly becomes close 
to linear. 

We illustrate the applicability of our paradigm on two fundamental graph partitioning problems: 
the undirected (generalized) sparsest cut and the balanced separator problems. By a simple use 
of our framework we obtain, for any integral k > 1, a (log ( - 1+0 ^ 1 ^ fe n)-approximation algorithm 
for the generalized sparsest cut problem that runs in time 0(m + \Ed\ + 2 k n 1+1 ^ 2k ~ 1 ^ log [/), 
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where \Ed\ is the number of the demand edges in the demand graph. Furthermore, in case of the 
sparsest cut and the balanced separator problems, we combine our techniques with the algorithm 
of Sherman [37] to obtain approximation algorithms that have even better dependence on k in the 
running time. Namely, for any k > 1 and e > 0, we present (log^ 1+ °^^ k+1 ^ 2 ^ n/y / e)-approximation 
algorithm^] for these problems that run in time 0(m + 2 fc n 1+1// ( 3 ' 2fc_1 ) +£ ). We summarize our 
results together with the previous work in Figure [TJ One can see that even for small values of 
k, the running times of our algorithms beat the multicommodity flow barrier of Q(n 2 ) and the 
bound of Q(m + n 3 / 2 ) time corresponding to performing sparsification and single-commodity flow 
computation. Unfortunately, in the case of the generalized sparsest cut problem our approximation 
guarantee has poly-logarithmic dependence on n. This is in contrast to the previous algorithms for 
this problem that have their approximation guarantee depending on r. 

1.4 Overview of Our Techniques 

Our approach is inspired by the cut-based graph decomposition of Racke [36] that was developed 
in the context of oblivious routing schemes (see [351 [T2| [T5l [28] for some previous work on this 
subject). This decomposition is a powerful tool in designing approximation algorithms for various 
undirected cut-based problem. It is based on finding for a given graph G with n vertices and m 
edges, a convex combination of decomposition trees {T{\i such that: G is embeddable into 

each Tj, and this convex combination can be embedded into G with O(logn) congestion. One 
employs this decomposition by first approximating the desired cut-based problem on each tree 
- this usually yields much better approximation ratio than general instances - and then extracting 
from obtained solutions a solution for the graph G while incurring only additional O(logn) factor 
in the approximation guarantee. 

The key question motivating our results is: can the above paradigm be applied to obtain very 
efficient approximation algorithms for cut-based graph problems? After all, one could envision 
a generic approach to designing fast approximation algorithms for such problems in which one 
decomposes first an input graph G into a convex combination of structurally simple graphs Gj 
(e.g. trees), solves the problem quickly on each of these easy instances and then combines these 
solutions to obtain a solution for the graph G, while losing some (e.g. poly-logarithmic) factor in 
approximation guarantee as a price of this speed-up. Clearly, the viability of such scenario depends 
critically on how fast a suitable decomposition can be computed and how 'easy' are the graphs GjS 
from the point of view of the problems we want to solve. 

We start investigation of this approach by noticing that if one is willing to settle for approxi- 
mation algorithms that are of Monte Carlo-type then, given a decomposition of the graph G into 
a convex combination {(Aj,Gj)}j, one does not need to compute the solution for each of GjS to 
get the solution for G. It is sufficient to just solve the problem on a small number of GjS that are 
sampled from the distribution described by AjS. 

However, even after making this observation, one still needs to compute the decomposition 
of the graph to be able to sample from it and, unfortunately, Racke's decomposition - that was 
aimed at obtaining algorithms that are just polynomial-time - has serious limitations when one 
is interested in time-efficiency. In particular, the running time of his decomposition procedure is 

4 As was the case in all the previous work we described, we only obtain a pseudo-approximation for the balanced 
separator problem. Recall that an a-pseudo-approximation for the balanced separator problem with balance constant 
c is a c'-balanced cut C - for some other constant c' - such that C's sparsity is within a of the sparsest c-balanced cut. 
For the sake of convenience, in the rest of the paper we don't differentiate between pseudo- and true approximation. 
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dominated by 0(m) all-pair shortest path computations and is thus prohibitively large from our 
point of view - see section IBTTl for more details. 

To circumvent this problem, we design an alternative and more general graph decomposition 
(see Theorem 13. 6p , Similarly to the case of Racke's decomposition, our construction is based 
on embedding graph metrics into tree metrics. However - instead of the embedding result of 
Fakcharoenphol, Talwar, and Rao [23] that was used by Racke - we employ the nearly-linear time 
algorithm of Abraham, Bartal, and Neiman pQ for finding low-average-stretch spanning trees. This 
choice allows for a much more efficient implementation of our decomposition procedure at a cost 
of decreasing the quality of provided approximation from O(logra) to O(logn). Also, even more 
crucially, inspired by the notion of the ultrasparsifiers of Spielman and Teng [40 \ I41j . we allow 
flexibility in choosing the type of graphs into which our graph is decomposed. In this way, we are 
able to offer a trade-off between the structural simplicity of these graphs and the time needed to 
compute the corresponding decomposition. 

Finally, by recursive use of our decomposition - together with sparsification technique - we are 
able to leverage the above-mentioned flexibility to design a procedure that can have its running 
time be arbitrarily close to nearly-linear and, informally speaking, allows us to sample from a 
decomposition of our input graph G into graphs whose structure is arbitrarily simple, but at a cost 
of getting proportionally worse quality of the reflection of the cut structure of G ~ see Theorem 13.71 
for more details. 

1.5 Outline of the Paper 

We start with some preliminaries in section [2j Next, in section [3l we introduce the key concepts 
of our paper and state the main theorems. In section [4] we show how our framework leads to 
fast approximation algorithms for undirected cut-based minimization problems. We also construct 
there our algorithms for the (generalized) sparsest cut and balanced separator problems and remark 
on extending our framework to make it handle multicut-based minimization problems. Section [5] 
contains the description of our decomposition procedure that allows expressing general graphs as 
a convex combinations of simpler ones while approximately preserving cut-flow structure. We 
conclude in section [6] with showing how a recursive use of this decomposition procedure together 
with sparsification leads to the sampling procedure underlying our framework. 

2 Preliminaries 

In this paper we will be concerned with an undirected graphs G = (V, E, u) having vertex set V, 
edge set E, and integer capacities u : E — )■ Z + on the edges. By capacity ratio U of G we mean the 
maximum possible ratio between capacities of two edges of G i.e. U := max e e / g e u(e)/u(e'). 

For a given graph G = (V,E,u), by a cut C in G we mean any vertex set ^ C C V. We 
denote by E(C) the set of all the edges of G with exactly one endpoint in C - we say that these 
edges are cut by C. By C we will denote the set V \ C. Also, we define the capacity u(C) of a cut 
in G to be the total capacity u{E{C)) of all the edges in E(C). Finally, for some V C V, we say 
that a subgraph G 1 = (V , E' ,u') is a subgraph of G induced by V' if E' consists of all the edges of 
G whose both endpoints are in V' and the capacity function v! on these edges is inherited from the 
capacity function u of G. 
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2.1 Maximum Concurrent Flow Problem 

For a given graph G = (V, E, u), by a multicommodity flow f = (f 1 , . . . , f k ) we mean a set of k 
flows f % in G, where each flow f l routes a commodity i from some source Sj to some smfc ij. Let us 
also define |/(e)| as the total flow routed along the edge e by /, i.e. |/(e)| := ^ We say 

that a multicommodity flow / is feasible if for every edge e, |/(e)| < u(e). 

One of the most popular multicommodity flow problems is the maximum concurrent flow prob- 
lem. In this problem, in addition to the graph G = (V,E,u), we are given a demand graph 
D = (V, Ed, d). The objective is to find a feasible multicommodity flow / in G such that for each 
demand edge e = (v, w) £ Ed there is a flow f e that routes 0d(e) units of corresponding commodity 
from v to w and the flow rate 6 is maximized. Another important flow problem - that can be viewed 
as a special case of the maximum concurrent flow problem - is the maximum s-t flow problem (or 
just the maximum flow problem) that corresponds to a task of finding a feasible single-commodity 
flow that maximizes the throughput being the amount of flow pushed from the source s to the sink 
t. 

2.2 Embedability 

A notion we will be dealing extensively with is the notion of graph embedding. 

Definition 2.1. For given graphs G = (V,E,u) and G = (V,E,u), by an embedding f of G into 
G we mean a multicommodity flow f = (f ei , ■ ■ ■ , / e,B| ) in G with \E\ commodities indexed by the 
edges of G, such that for each 1 < % < \E\, the flow f e ' routes in G u(ei) units of flow between 
endpoints ofei. 

One may view an embedding / of G into G as a concurrent flow in G whose source-sink pairs and 
corresponding demands are given by the endpoints of edges of G and their corresponding capacities. 
Note that the above definition does not require that / is feasible. We proceed to the definition of 
embedability. 

Definition 2.2. For given t > 1, graphs G = (V,E,u), and G = (V,E,u), we say that G is t- 
embeddable into G if there exists an embedding f ofG into G such that for all e 6 E, |/(e)| < fu(e). 
We say that G is embeddable into G if G is 1-embeddable into G. 

Intuitively, the fact that G is t-embeddable into G means that we can fractionally pack all the 
edges of G into G with all its capacities u multiplied by t. Also, it is easy to see that for given 
graphs G and G, one can always find the smallest possible t such that G is t-embeddable into G by 
just solving maximum concurrent flow problem in G in which we treat G as a demand graph with 
the demands given by its capacities. 

2.3 Graph Sparsification 

A powerful tool in designing fast algorithms for cut-related problems is graph sparsification - this 
procedure allows approximation of all the cuts of any - possibly dense - graph by its sparse subgraph 
with appropriately chosen capacities. In particular, the following theorem was proved by Benczur 
and Karger [H] (see also [13], [39], and [13] for more general results on sparsification). 

Theorem 2.3 ([33]). Given graph G = (V,E,u) and an accuracy parameter 5 > 0, there is a 
Monte Carlo algorithm that finds in 0(\E\) time a subgraph G = (V,E,u) of G such that 
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(i) G has 0{\V\\og\V\5~ 2 ) edges; 



(ii) for any cut 7^ C C V of G, we have u{C) < u(C) < (1 + 5)u(C) i.e. the cuts of G are 
(1 + 5) -approximately preserved in G; 

(Hi) the capacity ratio of G is at most 0(\V\) times the capacity ratio of G. 

3 Graph Decompositions and Fast Approximation Algorithms for 
Cut-based Problems 

The central concept of our paper is the notion of (a, ^-decomposition. This notion is a general- 
ization of the cut-based graph decomposition introduced by Racke [36] . 

Definition 3.1. For any a > 1 and some family of graphs Q, by an (a, (^-decomposition of a 

graph G = (V, E, u) we mean a set of pairs {(Aj, Gi)}i, where for each i, Aj > and Gi = (V, E, Ui) 
is a graph in Q, such that: 

(a) E,A, = 1 

(b) G is embeddable into each Gi 

(c) There exist embeddings fi of each Gi into G such that for each e € E, Y2i Aj|/i(e)| < cra(e). 

Moreover, we say that such decomposition is /c-sparse if it consists of at most k different Gi . 

The crucial property of (a, ^-decomposition is that it captures a-approximately the cut struc- 
ture of the graph G. We formalize this statement in the following easy to prove fact. 

Fact 3.2. For any a > 1, graph G = (V,E,u), and a family of graphs Q, if {(Aj,Grj)}j is an 
(a, Q)- decomposition of G then for any cut C of G: 

(lowerbounding) for all i, the capacity Ui{C) of C in Gi is at least its capacity u(C) in G; 
(upperbounding in expectation) E^[u(C)] := ^2n\ui{C) < au{C). 

Note that the condition (jaj) from Definition 13. II implies that {(Aj, Gi)}i is a convex combination 
of the graphs {G{\i. Therefore, one of the implications of the Fact 13.21 is that for any cut C in G, 
not only the capacity of C in every Gi is lowerbounded by its capacity u(C) in G, but also there 
always exists a graph Gj in which the capacity of C is at most cra(C). As one can easily convince 
oneself, for any (3 > 1, this property alone allows to reduce a task of a/3-approximation of any 
cut-based minimization problem in G to a task of /3-approximating this problem in each of GjS. 

In fact, if one is willing to settle for Monte Carlo-type approximation guarantees, one can reduce 
the task of approximation of such problem to a task of approximating it in only a small number of 
GjS. To make this statement precise, let us introduce the following definition. 

Definition 3.3. For given G = (V,E,u), a > 1, and 1 > p > 0, we say that a collection {G{\i of 
random graph^ Gi = (V,Ei,Ui), a-preserves the cuts of G with probability p if for every cut C of 
G: 

5 We formalize the notion of random graphs by viewing each Gi as a graph on vertex set V chosen according to 
some underlying distribution over all such graphs. 
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(lowerbounding) for alii, the capacity Ui(C) of C in G{ is at least its capacity u{C) in G; 

(probabilistic upperbounding) with probability at least p, there exists i such that Ui(C) < 
au(C). 

With a slight abuse of notation, we will say that a random graph G' is a-preserving the cuts of G 
with probability p if the corresponding singleton family {G'} is doing it. Now, a useful connection 
between (a, {^-decomposition of graph G and obtaining graphs 0(a)-preserving the cuts of G with 
some probability is given by the following fact whose proof is a straight-forward application of 
Markov's inequality 

Fact 3.4. Let G = (V,E,u), a > 1, 1 > p > and let {(Aj,Gj)}j be some (a, Q) -decomposition of 
G. If G' is a random graph chosen from the set {Gi}i according to distribution given by \{S i.e. 
Pr[G' = Gi] = Xi, then G' la-preserves cuts of G with probability 1/2. 

Therefore - as we will formally prove later - for any (3 > 1, we can obtain a Monte Carlo 
2a/3-approximation algorithm for any minimization cut-based problem in G, by /3-approximating 
it in a sample of 0(ln \V\) GjS that were chosen according to distribution described by A.;s. 

3.1 Finding a Good (a, (^-decomposition Efficiently 

In the light of the above discussion, we focus our attention on the task of developing a (a,G)- 
decomposition of graphs into some family Q of structurally simple graphs that enjoys relatively 
small value of a. Of course, since our emphasis is on obtaining algorithms that are very efficient, 
an important aspect of the decomposition that we will be interested in is the time needed to compute 
it. 

With this goal in mind, we start by considering the following theorem due to Racke [36j that 
describes the quality of decomposition one can achieve when decomposing the graph G into a family 
Qt of its decomposition trees (cf. [36] for a formal definition of a decomposition tree). 

Theorem 3.5 (|36j). For any graph G = (V,E,u), an 0{m)-sparse (O (log n),Qx)- decomposition 
of G can be found in polynomial time, where n = \ V\ and m = \E\. 

Due to structural simplicity of decomposition trees, as well as, the quality of cut preservation 
being the best - up to a constant - achievable in this context, this theorem was used to design 
good approximation algorithms for a number of cut-based minimization problems. 

Unfortunately, from our point of view, the usefulness of Racke's decomposition is severely 
limited by the fact that the time needed to construct it is not acceptable when one is aiming 
at obtaining fast algorithms. More precisely, the running time of the decomposing algorithm of 
|36j is dominated by 0(m) executions of the algorithm of Fakcharoenphol, Talwar and Rao |23] 
that solves the minimum communication cost tree problem with respect to a cost function devised 
from the graph G (cf. [36] for details). Each such execution requires, in particular, computation 
of the shortest-path metric in G with respect to some length function, which employs the well- 
known all-pair shortest path algorithm \25 \ |4* H [2] running in 0(min{mn, n 2 6 }) time. This results 
in - entirely acceptable when one is just interested in obtaining polynomial-time approximation 
algorithms, but prohibitive in our case - 0(m min{ mn, n 2,376 }) total running time. 

One could try to obtain a faster implementation of Racke's decomposition by using a nearly- 
linear time low-average-stretch spanning tree construction due to Abraham, Bartal, and Neiman 
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PQ in place of the algorithm of [23] . This would lead to an 0(m 2 ) time decomposition of G into 
its spanning trees that has a - slightly worse - quality of O(logre) instead of the previous O(logn). 
However, although we will use [1] in our construction (cf. section 15 . 2[) . 0(m 2 ) time is still not 
sufficiently fast for our purposes. 

Our key idea for circumventing this running time bottleneck is allowing ourselves more flexibility 
in the choice of the family of graphs into which we will decompose G. Namely, we will be considering 
(a, (^-decompositions of G into objects that are still structurally simpler than G, but not as simple 
as trees. 




Figure 2: An example of a j-tree H, its core H', and its envelope F consisting of bold edges. 

To this end, for j > 1, we say that a graph H = (Vh, Ejj,uh) is a j-tree (cf. Figure [2]) if it is a 
connected graph being a union of: a subgraph H' of H induced by some vertex set V' H C Vh with 
< j; and of a forest F on Vh whose each connected component has exactly one vertex in V H . 
We will call the subgraph H' the core of H and the forest F the envelope of H£] 

Now, if we define Gv\j] to be the family of all the j-trees on the vertex set V, the following 
theorem - being the heart of our framework - holds. Its proof appears in section 

Theorem 3.6. For any graph G = (V,E,u) and t > 1, we can find in time 0{tm) a t-sparse 
(0(logn),Gv[0( 7 -^^-)])-decomposition {(Aj,Gj)}j of G, where m = \E\, n = \V\, and U is the 
capacity ratio of G. Moreover, the capacity ratio of each Gi is 0(mU). 

Intuitively, the above theorem shows that if we allow j = 0( ml ° g ^ ) to grow, the sparsity of 
the corresponding decomposition of G into j-trees - and thus the time needed to compute it - will 
decrease proportionally!!] 

Note that a 1-tree is just an ordinary tree, so by taking t in the above theorem sufficiently large, 
we obtain a decomposition of G into trees in time 0(m 2 log U) If] Therefore, we see that compared 

6 Note that given a j-tree we can find its envelope in linear time. Therefore, throughout the paper we always 
assume that the j-trees we are dealing with have their envelopes and cores explicitly marked. 

interestingly, such a trade-off is somewhat reminiscent of the trade-off achieved by Spielman and Teng [401 141 1 
between the number of additional edges of an ultrasparsifier and the quality of spectral approximation provided by 
it. 

8 In fact, one might show that if our goal is to decompose G into trees then the running time of our algorithm is 
just 0(m 2 ). 
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to the decomposition result of Racke (cf. Theorem 13. 5|) . we obtain in this case a decomposition of 
G with more efficient implementation and into objects that are even simpler than decomposition 
trees, but at a cost of slightly worse quality. 

However, the aspect of this theorem that we will find most useful is the above-mentioned trade- 
off between the simplicity of the j-trees into which the decomposition decomposes G and the time 
needed to compute it. This flexibility in the choice of t can be utilized in various ways. 

For instance, one should note that, in some sense, the core of a j-tree H captures all the non- 
trivial cut-structure of H. In particular, it is easy to see that maximum flow computations in 
H essentially reduce to maximum flow computations in H's core. So, one might hope that for 
some cut-based problems the complexity of solving them in H is proportional to the complexity of 
solving them in the - possibly much smaller than H - core of H (this is, for example, indeed the 
case for the balanced separator and the sparsest cut problems - see section I4.3() . Thus one could 
use Theorem l3.6l - for appropriate choice of t - to get a faster algorithm for this kind of problems by 
just computing first the corresponding decomposition and then leveraging the existing algorithms to 
solve the given problem on a small number of sampled j-trees, while paying an additional O(logn) 
factor in the approximation quality for this speed-up. 

Even more importantly, our ability to choose in Theorem 13.61 sufficiently small value of t, as 
well as, the sparsification technique (cf. Theorem I2.3[) and recursive application of the theorem 
to the cores of 0( ml ° gt/ )-trees sampled from the computed decomposition, allows establishing the 
following theorem - its proof appears in section [6J 

Theorem 3.7. For any 1 > I > 0, integral k > 1, and any graph G = (V,E,u), we can find in 

0{m + 2 k n- 1+ i k -^ \ogU) time a collection of (2 k+1 Inn) n l -trees {Gi}i that (log( 1+ °( 1 ^ fc n)-preserve 
the cuts of G with high probability. Moreover, the capacity ratio of each Gi is n ty2+ °^ k U , where U 
is the capacity ratio of G, n = \V\, and m = \E\. 

As one can see, the above theorem allows obtaining a collection of j-trees that a-preserve cuts 
of G — for arbitrary j = n l - in time arbitrarily close to nearly-linear, but at a price of a growing 
accordingly as these two parameters decrease. 

Note that one can get a cut-preserving collection satisfying the requirements of the theorem by 
just finding an (0(log n), n')-decomposition of G via Theorem 13.61 and sampling - in the spirit of 
Fact 13.41 - O(logn) n'-trees from it so as to ensure that each cut is preserved with high probability. 
Unfortunately, the running time of such procedure would be too large. Therefore, our approach to 
establishing Theorem 13.71 can be heuristically viewed as an algorithm that in case when the time 
required by Theorem 13.61 to compute a decomposition of G into n'-trees is not acceptable, does not 
try to compute and sample from such decomposition directly. Instead, it performs its sampling 
by finding a decomposition of G into j-trees, for some value of j bigger than n l , then samples a 
j-tree from it and recurses on this sample. Now, the desired collection of n'-trees is obtained by 
repeating this sampling procedure enough times to make sure that the cut preserving probability is 
high enough, with the value of j chosen so as to bound the number of recursive calls by k — 1. Since 
each recursive call introduces an additional O(logn) distortion in the faithfulness of the reflection 
of the cut structure of G, the quality of the cut preservation of the collection generated via such 
algorithm will be bounded by log^ 1+ °^ k n. 



11 



4 Applications 



We proceed to demonstrating how the tools and ideas presented in the previous section lead to a 
fast approximation algorithms for cut-based graph problems. In particular, we apply our techniques 
to the (generalized) sparsest cut and the balanced separator problems. Later we remark on the fact 
that our framework is also applicable to multicut-based minimization problems. 
The following theorem encapsulates our way of employing our framework. 

Theorem 4.1. For any a > 1, integral k > 1, 1 > I > 0, and undirected graph G = (V,E,u) with 

n = \V\, m = \E\, and U being its capacity ratio, we can find in 0{m + 2 k n 2k ~ 1 \ogU) time, a 
collection of 2 k+1 Inn n l -trees {G{}i, such that for any instance P of any cut-based minimization 
problem V the following holds with high probability. If {C,*}?, is a collection of cuts of G with 
each C* being some a-optimal solution to P on the n l '-tree G{, then at least one of C* is an 
(a log( 1+ °( 1 ^ fc n) -optimal solution to P on the graph G. 

When we take I equal to zero, the above theorem implies that if we have an a-approximation 
algorithm for a given cut-based problem on trees that runs in T(m, n, U) time then, for any k > 1, we 
can get an (a log^ 1+0 ^^ fc n)-approximation algorithm for it in general graphs and the running time 
of this algorithm will be just 0(m + k2 k n^ 1+1 ^ 2k ~ r> log U) + (2 k+l hxn)T(m, n, U). Note that the 
computational overhead introduced by our framework, as k grows, quickly approaches nearly-linear. 
Therefore, if we are interested in designing fast poly-logaritmic approximation algorithms for some 
cut-based minimization problem, we can just focus our attention on finding a fast approximation 
algorithm for its tree instances. 

Also, an interesting feature of our theorem is that the procedure producing the graphs {Gi}i is 
completely oblivious to the cut-based problem we want to solve - the fact that this problem is a 
cut-based minimization problem is all we need to make our approach work. 

Proof of Theorem \4-l\ We produce the desired collection {G{\i by just employing Theorem 13.71 

with the desired value of k > 1 to find in time 0(m + 2 k n 2k - 1 \ogU) a set of t = (2 k+1 Inn) 
n'-trees that (log^ 1+o( - 1 ^ fc n)-preserve the cuts of G with high probability and output them as {Gi}{. 

Now, to prove the theorem, let C* be the cut that minimizes the quantity u(C*)fp(C*) among 
all the t solutions C*, . . . , C t * found, where fp is the function corresponding to the instance P of a 
problem V we are solving (cf. equation ([I])). Clearly, by definition of (log^ 1+ °^ 1 ^ fc n)-preservation 
of the cuts, the capacity of C* - and thus the quality of the solution corresponding to it - can only 
improve in G i.e. 

u(C*)fp(C*) < Uj (C*)f P (C*), 

where Gj is the n^tree to which C* corresponds and, for any i, Ui denotes the capacity function of 
Gi. 

Moreover, if we look at an optimum solution G OPT to our problem in G - i.e. C OPT = 
arg mm^Qf^y u(C)fp(C) - then, with high probability, in at least one of the GjS, say in 
has the capacity at most (log^ 1+ °^ k n) times larger than its original capacity u(C OPT ) in G. As 
a result, for the a-optimal solution C*, found by the algorithm in this Gj/ it will be the case that 

u(C*,)f P (Cp) < u f (C*,)f P (Cp) < au f (C OPT )f P (C OPT ) < a(log( 1+0 «) fc n)u(C OPT )f P (C OPT ), 
where the first inequality follows from the fact that {Gi}i (log^ 1+ °^ 1 ^ fc n)-preserve cuts of G. 
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But, by definition of C* , u(C*)f P (C*) < u(C*,)f P {C*,) and - by noting that u(C OPT )f P (C OPT ) 
is the objective value of an optimal solution to our instance - we get that C* is indeed an 
(a log( 1+ °^ 1 ^ fc n)-optimal solution to P with high probability. 

4.1 Computing Maximum Concurrent Flow Rate on Trees 

As Theorem 14.11 suggests, we should focus on designing fast approximation algorithms for tree 
instances of our problems. The basic tool we will use in this task is the ability to compute maximum 
concurrent flow rate on trees in nearly-linear time. The main reason why such a fast algorithm exists 
stems from the fact that if we have a demand graph D = (y,Eu,d) and a tree T = (V,Et,ut), 
there is a unique way of satisfying these demands in T. Namely, for each demand edge e G Ed the 
flow of d(e) units of corresponding commodity has to be routed along the unique path path r (e) 
joining two endpoints of e in the tree T. As a result, we know that if we want to route in T a 
concurrent flow of rate 9 = 1 then the total amount of flow flowing through a particular edge h of 
T is equal to 

u T [D](h) := £ d(e). 

eei?D,/i€path T (e) 

Interestingly, we can compute u T [D](h) for all h G Et in nearly-linear time. 

Lemma 4.2. For any tree T = (V,Et,ih) and demand graph D = (V,Er),d), we can compute 
u T [D)(h), for all h G E T , in 0{\E D \ + \V\) time. 

Proof: To compute all the values u T [D](h), we adapt the approach to computing the stretch of 
edges outlined by Spielman-Teng in [4T] , 

For a given tree T' = (V , Et>, Ut>) and a demand graph D' = (V',Ejjf,d') corresponding to it, 
let us define the size Sd'(T') of T' (with respect to D') as Sd'(T') := YlveV'O- + dD'{v)), where 
df)'(v) is the degree of the vertex v in D'. Let us define a splitter v* D ,{T') of T' (with respect to D') 
to be a vertex of T' such that each of the trees T[,...,T' obtained from T' by removal of v D ,(T') 
and all the adjacent edges has its size Sd>(T{) being at most one half of the size of T' i.e. 

Sd'{TI) < S D >(T')/2 = \E D >\ + \V'\/2, 

for each i. It is easy to see that such a splitter v*(T') can be computed in a greedy fashion in 
0(\V'\) = 0(S D >(T')) time. 

Our algorithm for computing u T [D]s works as follows. It starts with finding a splitter v* = 
v* D {T) of T. Now, let Eq be the set of demand edges e € Ed such that path-p(e) contains v* . Also, 
for 1 < % < q, let E{ C Ed be the set of edges e £ D for which path r (e) is contained entirely within 
Tj. Note that this partition of edges can be easily computed in 0{\Ed\ + |V|) = 0(Sd'(T')) time. 
Also, let us define Di to be the demand graph being the subgraph of D spanned by the demand 
edges from Ei (with demands inherited from D). 

As a next step, our algorithm computes in a simple bottom-up fashion u t [Dq\{K) for each 
h G Et- Then, for 1 < i < q, it computes recursively the capacities u Tl [Di] of all the edges of 
Tj - note that, by definition of D{, u Ti [Di](h) = u T [Di](h) for any edge h of Tj. Finally, for each 
h G Et, we output ur[D](h) = ^ u T [Di]{h). Note that, since for i > 1, u T [Di](h) ^ only if h is 
a part of Tj, the sum ^ i u 1 [Di](h) has at most two non-zero components and thus all the values 
u T [D]{h) can be computed in 0(\Ed \ + \V\) = 0(Sd{T)) time. Furthermore, the fact that, by our 
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choice of v*, Sp^Ti) < Sx>(T)/2 implies that the depth of recursion is at most log Sd{T) and the 
whole algorithm runs in 0(Sd(T)) = 0(\Ejy\ + \V\) time, as desired. 

I 

Now, the crucial thing to notice is that the best achievable flow rate 9* of the maximum 
concurrent flow is equal to min^g^j, u tJ^^ ■ Therefore, Lemma [4.21 implies the following corollary. 

Corollary 4.3. For any tree T = (V,Et,ut) and demand graph D = (V, Erj,d), we can find in 
0{\Ed\ + | ) time the optimal maximum concurrent flow rate 9* and an edge h* £ Et such that 

a* _ u T {h*) 
V ~ uT[D](h*)- 

Note that we only compute the optimal flow rate of the concurrent flow and not the actual flows. 
In some sense, this is unavoidable - one can easily construct an example of maximum concurrent 
flow problem on tree, where the representation of any (even only approximately) optimal flow has 
size n(\E D \\V\). 

4.2 Generalized Sparsest Cut Problem 

We proceed to designing a fast approximation algorithm for the generalized sparsest cut problem. 
We start by noticing that for a given tree T = (V,Et,ut), demand graph D = (V,Erj,d), and 
an edge h of this tree, the quantity Jrjjy^h) ls exac tly the (generalized) sparsity of the cut that 
cuts in T only the edge h. Therefore, Corollary 14.31 together with the fact that the sparsity of the 
sparsest cut is always an upper bound on the maximum flow rate achievable, gives us the following 
corollary. 

Corollary 4.4. For any given tree T = (V,E,u) and demand graph D = (V,E£>,d), an optimal 
solution to the generalized sparsest cut problem can be computed in 0(\Er)\ + |^|) time. 

Now, by applying Theorem 14. II together with a preprocessing step of sparsification of D, we are 
able to obtain a poly-logarithmic approximation for the generalized sparsest cut problem in time 
close to linear. 

Theorem 4.5. For any graph G = (V,E,u), demand graph D = (V, E£>,d), and integral k > 
1, there exists a Monte Carlo log^ 1 " 1 "^ 1 )^ n- approximation algorithm for generalized sparsest cut 
problem that runs in time 0(m + \En\ + 2 k n <yl+l ^ 2 \ogU), where n = \V\, m = \E\, and U is 
the capacity ratio of G. 

Proof: We start by employing Theorem 12.31 with 5 equal to 1, to sparsify both G - to obtain a 
graph G = (V, E, u)) - and the demand graph D - to obtain a demand graph D = (V, Er>,d) - in 
total time of 0(m + |-Ed|)- Note that computing sparsity of a cut with respect to these sparsified 
versions of G and D leads to a 4-approximate estimate of the real sparsity of that cut. Since this 
constant-factor error is acceptable for our purposes, we can focus on approximating the sparsest cut 
problem with respect to G and D. To this end, we just use Theorem 14.11 together with Corollary 
14.41 and, since both and \Ed\ have 0{n) edges, our theorem follows. 

4.3 Balanced Separator and Sparsest Cut Problem 

We turn our attention to the balanced separator problem. Analogously to the case of the generalized 
sparsest cut problem above, to employ our approach we need an efficient algorithm for the tree 
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instances of the balanced separator problem. Unfortunately, although we can solve this problem 
on trees optimally via dynamic programming, there seems to be no algorithm that does it very 
efficiently - ideally, in nearly-linear time. Therefore, we circumvent this problem by settling for a 
fast but approximate solution. 

Namely, we use the result of Sherman [37] who shows that, for any e > 0, the balanced separator 
problem - as well as the sparsest cut problem - can be 0(y / logn/e)-approximated in a graph G - 
with n vertices and m edges - in time 0(m + n 3 / 2+e ). This running time corresponds to sparsifying 
G and then using the fastest known algorithm for the maximum flow problem due to Goldberg and 
Rao [27J to perform maximum flow computations^ on a sequence of n £ graphs that are derived in 
a certain way from the sparsified version of G. 

Unfortunately, 0(m + n 3 / 2+£ ) running time is still not sufficiently fast for our purposes. At 
this point, however, we recall that maximum flow computation in a j-tree reduces to the task of 
finding the maximum flow in its core. So, if we want to perform a maximum flow computation on 
a j-tree that has its core sparsified (i.e. the core has only 0(j) edges), the real complexity of this 
task is proportional to j as opposed to being proportional to n. This motivates us to obtaining an 
implementation of Sherman's algorithm on j-trees that achieves better running time. The proof of 
the following lemma appears in Appendix IA.1[ 

Lemma 4.6. For any j-tree G = (V,E,u) and e > 0, we can 0(y/logn/e)- approximate the 
balanced separator and the sparsest cut problems in 0(m + n e (n + j 3 / 2 )) time, where m = \E\ and 
n = \V\. 

Before we proceed further, we note the following lemma whose proof is presented in Appendix 

Lemma 4.7. For any graph G = (V,E,u), we can find a \V\ 2 -approximation to the sparsest cut 
and the balanced separator problems in time 0(\E\). 

Now, we can use Theorem 14.11 and Lemma 14.61 - for the right choice of j = n - together with 
a simple preprocessing making the capacity ratio of the graphs we are dealing with polynomially 
bounded, to obtain the following result. 

Theorem 4.8. For any e > 0, integral k > 1, and graph G = (V, E, u), we can (\og( 1+ °W)( k+1 l 2 ) n/yfe)- 

approximate the sparsest cut and the balanced separator problems in time 0(m + 2 fc n 1+3 ' 2fc - 1+e ), 
where m = \E\ and n = \V\. 

Proof: Let us assume first that the capacity ratio of G is polynomially bounded i.e. it is n°^\ In 
this case, we just use Theorem 14. II on G with I = ^ 2 ^-i ^° °bt a rn a collection of (2 fe+1 Inn) n'-trees 
{Gi}i in time 

0(m + 2 fc n (1+ ^T ) ) = 0(m + 2 k n {l+ ^^ ) ). 

technically, when the input graph has 0(n) edges, the algorithm of Goldberg and Rao finds in time 0(n 3 ^ 2 log l/<5) 
an integral s-t flow and a s-t cut with values of respective throughput and capacity being within (1 + S) of each other. 
So, to ensure that this flow is optimal, one needs to use <5 smaller than the inverse of the value of minimum s-t cut 
of the graph. In principle, this value could be as large as Cl(nU) which would result in 0(n 3 ^ 2 log U) running time. 
However, the graphs that are considered in Sherman's algorithm have the value of minimum s-t cut always bounded 
by n/2. 
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Next, we compute - using the algorithm from Lemma [4.6l - O ( y^og n / ^-approximately optimal 
solutions to our desired problem - being either the sparsest cut or the balanced separator problem 
- on each of GjS in total time of 

0(m + {2 k+1 \nn)n £ {n + n 3l/2 )) = 0{m + 2 k n + ^^ +£ ). 

By Theorem 14. II we know that choosing the best one among these solutions will give, with high 
probability, a (\og^ 1+ °^^ k+1 ^ n/- v /e)-approximately optimal solution that we are seeking. 

To ensure that G has its capacity ratio always polynomially bounded, we devise the following 
pre-processing procedure. First we use Lemma 14.71 to find a cut C being n 2 -approximation to the 
optimal solution of our desired problem. Let £ be the sparsity of C. We remove all the edges in G 
that have capacity smaller than (,/mn 2 and, for all the edge with capacity bigger than £n, we trim 
their capacity to Qn. Clearly, the resulting graph G' has its capacity ratio polynomially bounded. 
Also, the capacity of any cut in G can only decrease in G' . 

Now, we just run the approximation algorithm described above on G 1 instead of G. Also, in the 
case of approximating the balanced separator problem, we set as our balance constant the value 
c" := min{c, c'}, where d := min{|C|, |C|}/n and c is the balance constant of our input instance. 
Since the capacity ratio of G' is polynomially bounded, the running time of this algorithm will 
be as desired. Moreover, we can assume that the a-approximately optimal cut C output will 
have sparsity at most ( in G' - otherwise, we can just output C as our solution. This means 
that C does not cut any edges with trimmed capacity and thus the capacity of C" in G (after 
putting the removed edges back) can be only by an additive factor of Q/n 2 larger than in G' . But 
since C^jn 2 is a lower bound on the sparsity of the optimal solution, we see that C is our desired 
(log( 1+0 ( 1 "( fc+1 ' 2 ) n/y / e : )-approximately optimal solution. 

Recall that our main motivation to establishing Lemma 14.61 was our inability to solve the bal- 
anced separator problem on trees efficiently. However, the obtained trade-off between the running 
time of the resulting algorithm and the quality of the approximation provided for both the balanced 
separator and the sparsest cut problems is much better than the one we got for the generalized 
sparsest cut problem (cf. Theorem 14. 5p . This shows us that sometimes it is beneficial to take 
advantage of the flexibility in the choice of I given by Theorem l4.1l by combining it with an existing 
fast approximation algorithm for our cut-based problem that allows a faster implementation on 
tree instances. 

4.4 Extension to Multicut-based Problems 

We briefly remark on the applicability of our framework to a class of undirected multicut-based 
minimization problems which generalize the undirected cut-based minimization ones we consider in 
the paper. To this end, let us define a problem V to be an (undirected) multicut-based (minimiza- 
tion) problem if its every instance P £ V on a graph G = (V, E, u) can be cast as a task of finding 
a partition {C*}^ of vertices of V that minimizes the quantity u({Ci} k =1 ) fp({Ci} k =1 ) over all 
partitions {Ci] k =l of the vertex set V. Here, u({Ci} k =1 ) is the total capacity of all the edges of 
G whose endpoints are in different CjS. Once again, we require fp to be a non-negative function 
that can depend on P, but does not depend on the graph G. Note that in our definition we do not 
restrict a priori the number of sets into which V is partitioned in considered partitions - this can be 
however specified by appropriate choice of the function fp. Two important examples of problems 
that are captured by this definition are the multiway cut problem (cf . [20l ETj [TBI [HI [29] ) and the 
multicut problem (cf. |26j). 



16 



To see why our framework can be extended to handle multicut-based problems, note that for 
any partition {Cj}jL 1 , we can express the total capacity of edges cut by it as a linear combination 
of the capacities of the cuts corresponding to each set Cj. Namely, u({Cj}^ =1 ) = \ Z)t=i 
Therefore, if G is our input graph then any graph that approximates the cuts of G also approximates 
its multicuts. This implies, in particular, that sparsification (cf. Theorem I2.3|) preserves the multi- 
cuts up to a factor of (1 + 5). Furthermore, by linearity of expectation, the analogs of Fact 13.21 and 
13.41 hold for multicuts as well. One can check, however, that this is all that we need to prove an 
extension of Theorem 14.11 that handles minimization multicut-based problems. 

5 Proof of Theorem 13.61 

Let us fix throughout this section t > 1, the graph G = (E,V,u) to be (O(logra), Qy [0( ml ° t gU )])- 
decomposed, m = \E\, n = |V|, and U equal to the capacity ratio of G. Our proof of Theo- 
rem 13.61 consists of two main steps. First, we show how to quickly decompose G into a i-sparse 
(d(logn),%[0(^M)])-decomposition {(A;, where is a family of graphs that we will 

define shortly. Then, as a second step, we prove that for any graph H S T~i[j] we can efficiently 
find a graph G E 0v[O(j)] (i.e. a graph G being a O(j)-tree) such that H is embeddable into G 
and G is 9-embeddable into H. This will imply that both these graphs are equivalent (up to a 
constant) with respect to their cut-flow structure. As a result, if we consider a decomposition of G 
into convex combination {(Aj,Gj)}j, where each graph Gi is the graph from Qv[0{j)} equivalent - 
in the above sense - to the graph Hi G H\j], then we will be able to show that this constitutes the 
t-sparse (0(logn),g y [0(^g^)])-decomposition of G that we are seeking. 

5.1 Graphs H(T, F) and the Family U[j] 

To define the family H[j], consider some spanning tree T = (V, E?) of G. There is a unique way of 
embedding G into T. Namely, for each edge e = (u,v) of G we route the corresponding u(e) units 
of flow along the unique u-v path path r (e) in T. This implies that if we want G to be embeddable 
into T then each edge e of the tree T has to have capacity of at least u T (e), where we define 

u T (e) := £ u(e'). 

e'G-E:e£path T (e') 

Now, for a subset of edges F C Et, let us define 

E[T](F) :={eeE : path T (e) n F ^ 0}. 

Finally, let H(T, F) = (V,E,u) be a graph (cf. Figure ED with edge set E := E T UE[T}(F) and 
the capacities u(e), for e G E, being equal to: 

fu(e) if e G E[T](F) 
I u ye) otherwise. 

In other words, H (T, F) is a graph obtained by taking the forest corresponding to the tree T 
with edges of F removed and adding to it all the edges that are in the set E[T](F) containing the 
edges of E whose endpoints are in different components of this forest (note that F C E[T](F)). 
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Figure 3: a) An example of a graph G (solid edges), its spanning tree T (bold edges), and a subset 
F of edges of T (dotted edges), b) Graph H(T,F) corresponding to the example from a). The 
edges of T \ F are bold and the edges of the set E[T](F) are dashed. 

The capacity u(e) of an edge e of H (T, F) is equal to the capacity u T (e) inherited from the tree T 
- if e is from the forest T \F; and it is just the original capacity u(e) of e otherwise. 

An observation that will turn out to be useful later is that both the capacity function u T and 
the graph H(T, F) can be constructed very efficiently - cf. Appendix IB. II for the proof of the 
following lemma. 

Lemma 5.1. Given a spanning tree T = (V,Et) of G and some F C Et, we can compute u T (e) 
for all e E Et and construct the graph H(T, F) in 0{m) time. 

Now, we define H[j] to be the family of all the graphs H(T,F) arising from all the possible 
choices of a spanning tree T of G and of a subset F of edges of T such that \F\ < j. 

5.2 Obtaining an (O(logn), %[0(^^)])-decomposition of G 

For a given length function / on G and a subgraph H = (V, Eh ,uh) of G, let us define the 
volume 1(H) of H (with respect to I) to be 1(H) := Yl e eE H K e ) u fi(e)- Also, for an edge e G Eh, let 
7ff(e) := uu(e) an< ^ us denote by 'j(H) the minimum value of 7# (e), i.e. j(H) := mm e& E H 1h(&)- 
Furthermore, let us define a set k(H) as 

k(H) := {e€E H : 7ff(e) < 2 7 (H)}. 

Intuitively, j(H) corresponds to the inverse of maximal congestion incurred on edges of G by 
an identity embedding of H into G - this identity embedding just routes the flow corresponding to 
given edge e of H along its counterpart edge in G. Now, k(H) is the set of all the edges e of H 
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such that the inverse of the congestion incurred on e in this identity embedding is within a factor 
of at most two of j(H). 

The tool we will use to obtain an (O(logre), %[0(^p-^)])-decomposition of G is the following 
theorem that was implicitly proved by Racke in [36] by adapting an algorithm of Young [15] . The 
proof of the following theorem can be found in Appendix IB.2I 

Theorem 5.2. If there is an a > him and a family of graphs Q such that for any length function 
I on G we can find in 0{m) time a subgraph Hi = (V, Eh^uh^ of G that belongs to Q and: 

(i) l(H t ) < al(G), 

(ii) G is embeddable into Hi, 

(Hi) \<m>^r 

then a t-sparse (2a, Q)- decomposition of G can be computed in 0(tm) time. 

Note that in the theorem we insist that the graph Hi is a subgraph of G. This, together with 
our requirement that G is embeddable into H, implies in particular that 7(-ffj) < 1. 

At the high level, the way the decomposition is obtained in the above theorem is by iterative 
packing into G of a "f(Hi) fraction of the graph Hi with the length functions I changing from 
iteration to iteration. In this process, the length function models the congestion incurred on edges 
of G by the subgraphs packed so far, i.e. the length 1(e) of an edge e will grow exponentially with 
e's hitherto congestion. 

Recall that, by definition of 'y(Hi), we know that the packing of the j(Hi) fraction of the graph 
Hi alone will not overflow the capacity of any edge of G. Furthermore, by insisting on each Hi 
having relatively small volume we make sure that the whole packing does not congest any edge of 
G by a factor greater than a on average. 

Finally, by lower-bounding of the cardinality of k(H{) we make sure that whenever we pack 
some j(Hi) fraction of Hi we make sufficient progress - the congestion of at least \k(Hi)\ of edges 
of G increases by at least 1/2. This allows us to bound the sparsity of the final decomposition, 
since in the end no edge has congestion bigger than 2a. 

In the light of the above, we proceed to presenting an efficient construction that for any 
length function I produces a graph Hi € ?^[0( ml ° g ^ )] that satisfies the requirements of Theo- 
rem [52J Clearly, such a construction will immediately yield our desired (0(logn),H[0(^f^)})- 
decomposition of G. 



Construction of the Graph Hi 

Let us fix the length function I throughout this section. To explain our construction of the graph 
Hi that satisfies the requirements of the Theorem 15.21 we first introduce the notion of low-average- 
stretch spanning trees - as we will see shortly such spanning trees will be the base of our construction 
of the graph flj. 

To this end, let V be some length function on a graph G' = (V',E') and let us define d l G , to be 
the shortest-path metric in G' with respect to this length function. Similarly, for a given spanning 
tree T' of G 1 , let d l T , be the shortest-path metric in T' with respect to I'. Now, for an edge e = (v, w) 
of G' we define the stretch stretch^/ (e) of e as 

stretch^, (e) := '-. 

d l G ,(v,w) 
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The following theorem was proved by Abraham, Bartal, and Neiman [T] (see also [3] and [22] 
for previous results on this topic and [23j for a related result). 

Theorem 5.3 ([I]). There is an algorithm working in 0{\E'\) time that for any length func- 
tion V on a graph G' = (V, E') generates a spanning tree T' of G' such that the average stretch 
TW] T^eeE' stretch^, (e) of edges in T' is 5(log \V'\). 

The following lemma uses the above low-average-stretch tree construction to obtain a spanning 
tree 7} of G that - after we impose capacities u Tl on it - has its volume l(T{) relatively small. 

Lemma 5.4. We can find in 0(m) time a spanning tree T\ = (V,Et l ) of G such that if we impose 
capacities u Tl on the edges of T\ then l(T\) < 2al(G), for some a being O(logn). 

Proof: Consider some spanning tree T = (V, Et) of G with capacities u T on its edges. 
Note that 

1{T) = £ l(f)u T (f) = £ /(/) < e ) = E d l T (v,w)u(e). 

f&E T f£E T eg£,/Gpath T (e) e=(v,w)€E 

Now, since d l G (e) < 1(e) for any edge e, we have 

l(T) = d l Ti (v,w)u{e) < J2stretch l T (e)l(e)u(e). 

e=(v,w)eE eeE 

Thus we see that to establish the lemma it is sufficient to find in 0(m) time a tree T such that 

^stretch( r (e)/(e)n(e) < 2al{G). 

e 

To this end, let us follow a technique used in [3] and define a multigraph G on vertex set V that 
contains r(e) := 1 + [ ^^fr^ J copies of each edge e G E. Note that the total number of edges of 
G - when counting multiple copies separately - is 

e e ^ ' 

Now, we use the algorithm^*! from Theorem 15.31 on G with length function I and obtain in 
0(X/ e r ( e )) = 0{m) time a spanning tree 7} of G - that also must be a spanning tree of G — such 
that: 

J2 e stretch^, (e)r(e) 

— < a, 



Ee r ( e ) 



for an a being O(logra). 

But, by the fact that for any e 



l(e)u(e)\E\ l(e)u(e)E e >r(e') 
{> ~ 1(G) ~ 21(G) 



10 Technically, the algorithm of [T] is designed for simple graphs, but it is straight-forward to adapt it to work on 
multigraphs with a running time being nearly-linear in the total number of edges. 
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we get 



E e stretch^ .(e)l(e)u(e) £ e stretch^, (e)r(e) 
■ < ■ < a. 

21(G) ~ £ e ,r(e') 

This means that 

l(Ti) < ^2 stretch^ (e)l(e)u(e) < 2al(G) 

e 

with a = O(logra), as desired. 

The crucial fact to note at this point is that the definition of u Tl ensures that G is embeddable 
into 7/ and thus the tree 7} is guaranteed to satisfy all the requirements of Theorem 15,21 - with a 
equal to 2a - except condition (|m|) . To illustrate our way of alleviating with this shortcoming of 
7], consider a hypothetical situation in which k(7}) = {e} for some edge e and for all the other 
edges / 7^ e of 7] we have 7t, (/) = 7 for some 7 3> 7(1/) = 7t, (e). Clearly, in this case 7} is a bad 
candidate for the graph 77/ (at least when t is not very large). 

However, the key thing to notice in this situation is that if - instead of 77/ = 7/ — we consider the 
graph Hi = H(Ti,F) with F = {e} then for all the edges / of 7} other than e we have jh { (/) = 7- 
Furthermore, for each edge / G 7?[7/]({e}) it is the case that jHi(f) = 3jj = 1 > 7- This means 
that j(Hi) = 7 and thus |k(77/)| > n — 2, which makes 77/ satisfy condition (jmj) for even very small 
values of t. 

We see, therefore, that in this case by adding only one bottlenecking edge e to F and considering 
the graph 77/ = 77(7], F) - as opposed to 77/ = 7} - we managed to make the size of the set k(Hi) 
really large. It turns out that by utilizing the fact that 1 > 7t ; (c) > 1/mU and thus there is at 
most [log mU~\ different values of jT t (e) such that no two of them is within a factor of two of one 
another, we can always make the above approach work. Namely, as we will see in the following 
lemma, we can get the size of k(77/) to be at least 4 ( 2 "+ 1 )" 1 while fixing in the above manner only 
0(^s£) edges of the tree T h 

Lemma 5.5. We can construct in 0(m) time a graph Hi = 77(7/, 7/), for some 7/ C with 
\Fi\ = 0( ml ° t gU ), that satisfies all the requirement of Theorem \5.S\ with a equal to (2a + 1). 

Proof: First, we prove that no matter what is our choice of the set 7) the graph 77/ = H(Ti,Fi) 
satisfies condition §j§ and (jn]) for a = (2a + 1). To this end, note that 

l(Hi)= Yl l(e)u Tl (e)+ Ke)u(e)<l(Ti) + l(G)<(2a + l)l(G). 

ee(E Tl \F t ) ee.E[Ti](Fi) 

So, the condition ([!]) holds. Furthermore, it is easy to see that G is embeddable into 77/ i.e. that 
the condition (ju]) holds as well. We can just embed each e S E[Ti](Fi) by routing corresponding 
flow along the same edge e in i7/ and each e ^ E[Ti](Fi) is embed by routing u(e) units of flow 
along the path path T( (e) that - by definition of E[Tr](Fi) - is contained entirely in Et { \ 7). 

We proceed to finding the subset 7/ of edges of the tree 7/ that will make Hi = H(Ti,Fi) satisfy 
also the condition (jm}. Let us define for < j < [logmU\ Fj(Ti) to be the set of edges e of 7} 
such that <7T,(e) < f^. 

Note that for any edge e of 7/ -^j < J^*^ < 1 - the first inequality follows since in the worst 
case all m edges will be routed in 7/ over e and the second one since e itself has to be routed in 7/ 
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over e. Therefore, the union U^q" 1 ^ Fj(T{) of all the Fj(T{) partitions the whole set Er l of the 
edges of the tree 7]. 

Now, let us take j* to be the largest j such that 

\ Ff ( Tl )\ < feM^M , 

t 

j'=0 

In other words, j* is the largest j such that we could afford to take as Fi the union of all the 
sets i*b(TJ), . . . ,Fj*(T[) and still have the size of F[ not exceed the desired cardinality bound of 

4(2a + l)m(\\ogmU\ + 1) _~ mlogt/". 

Note that we can assume that j* < [log mU\ . Otherwise, we could just afford to take F[ to be 
the union of all the sets Fj(Ti) i.e. F\ = Et v This would mean that Hi = H(Ti,Et,) is just the 
graph G itself and thus 'y(Hi) = 1 and \k(Hi)\ = m, which would satisfy the condition (Iml) . 

Once we know that j* < [log mU J , the definition of j* implies that 

£t\ 4(2a + l)m(Llogm^J+l) 

2^ wWl > 1 • 

i'=o 

However, since this sum has j* + 2 < [logmt/J + 1 summands, pigeon-hole principle asserts 
existence of < j < j* such that 



> 



4(2a + l)m 



Now, we define Fi to be the union Uj=o Fj(T{). By the fact that j < j* we know that the size of 
such Fi obeys the desired cardinality bound. Furthermore, we claim that if we take Hi = H(Ti,Fi) 
with such choice of Fi then |«(iJ^)| is large enough. 

To see this, note that the fact that 7h ; (c) = 1 for all edges e in E[Ti](Fi) implies that j(Hi) is 

at least ^jj- ■ But this means that all the edges from Fj +1 (Ti) are in k(Hi) and thus 

wflai > \F J+1 (T,)\ > m±»n, 

as desired. 

Now, to conclude the proof, we notice that Lemma 15.41 and Lemma 15.11 imply that the graph 
Hi as above can indeed be constructed in 0(m) time - to find the set Fi we just sort the edges of 
Ti according to 7*7} (e) and choose the appropriate value of j . 

By combining Theorem 15.21 Lemma 15.51 and the fact that - by definition - Hi G U[0( ml ° t gU )], 
we get the following corollary. 

Corollary 5.6. A t-sparse (a' ,%[0(^?S— )])- decomposition of G can be found in 0(tm) time for 
some a' being O(logn). 
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5.3 Obtaining an (0(\ogn),Q v [0( 



m log U 



)])-decomposition of G 



Let us call G' an almost- j -tree if it is a union of a tree and of an arbitrary graph on at most j 
vertices. Now, let us consider a graph H(T, F) for some choice of the spanning tree T of G and 
of a subset F of edges of T. A property of this graph that we will find useful is that the cut-flow 
structure of H(T,F) is similar to a cut-flow structure of an almost-0(|-F|)-tree G(T,F) and this 
G(T, F) can be found efficiently. 

Lemma 5.7. For any spanning tree T = (V, Et) of G and F C Et, we can find in 0(m) time an 
almost-0(\F\)-tree G(T,F) such that the graph H(T,F) is embeddable into G(T,F) and G(T,F) is 
efficiently 3-embeddable into H(T,F). Furthermore, the capacity ratio of G(T, F) is at most mU . 

Proof: Consider some edge e = (v,v') from E[T](F). Let us define v l (e) (resp. v 2 (e)) to be 
the vertex that corresponds to the first (resp. last) moment we encounter an endpoint of an edge 
from F while moving along the path path r (e) from v to v' . Also, let us denote by path^(e) (resp. 
path^(e)) the fragment of path T (e) between v and v 1 (e) (resp. between vf and v 2 (e)). 



Figure 4: a) An example of a graph H(T,F) from Figure El b) The corresponding graph G(T,F) 
with three projected edges e, /, and h. The set Proj(e) consist of edges of H(T,F) marked as 
dashed, the edges of the set Proj(/) are solid, and Proj(/i) contains all the edges that are dotted. 

We define the graph G(T, F) = (V,E',u') as follows. We make the edge set E' of G(T, F) to 
consist of the edges from Et \ F and of an edge / = (w,w') for each w, w' such that (w,w') = 
(v 1 (e) , v 2 (e)) for some edge e G E[T](F). We will call such edge / projected and define Proj(/) 
to be the set of all e G E[T](F) with (v 1 (e) , v 2 (e)) = (w,w'). Now, we set the capacity u'(e) in 
G(T, F) to be: 



a) 





if e G E T \ F; 
u(e') otherwise (i.e. if e is projected). 
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See Figure U] for an example of a graph H(T,F) and the graph G(T,F) corresponding to it. 

To see that this construction of G(T,F) can be performed quickly, note that by Lemma 15. II we 
can find the set E[T](F) and capacity function u T in 0{m) time. Moreover, by employing a simple 
adaptation of the divide-and-conquer approach that was used in Lemma 14.21 to compute u T [D] , we 
can compute all v 1 (e), v 2 (e) for each e G E[T](F) also in 0(m) time. 

By noting that if a vertex v is equal to v 1 (e) or v 2 (e) for some edge e then v has to be an 
endpoint of an edge in F, we conclude that the subgraph of G(T, F) induced by projected edges 
is supported on at most 2\F\ vertices. This means that G(T,F) - as a connected graph being a 
union of a forest Et \ F and a graph on 2\F\ vertices - is an almost-2|i* 1 |-tree. Furthermore, note 
that u T (e) < mmax e ' € E u(e') for each edge e and that the capacity u'(f) of an projected edge / 
is upperbounded by J2e'eE u ( e ') — "imaVeE u(e'). So, the fact that no edge e in G(T,F) has its 
capacity smaller than u(e) implies that the capacity ratio of G(T, F) is at most mU. 

To relate the cut-flow structure of H(T, F) and G(T, F), one can embed H(T, F) into G(T, F) 
by embedding each edge e £ Et \ F of H(T, F) into its counterpart edge in G(T, F) and by 
embedding each edge e € E[T](F) by routing the corresponding flow of u(e) units along the path 
formed from paths path^(e) and path^(e) connected through the projected edge (v 1 (e) , v 2 (e)) . It 
is easy to see that our definition of u' ensures that this embedding does not overflow any capacities 
of edges of G(T,F). 

On the other hand, to 3-embed G(T,F) into H(T,F), we embed each non-projected edge of 
G(T,F) into the same edge in H(T,F). Moreover, we embed each projected edge / by splitting 
the corresponding u'(f) = S e eProj(/) u ( e ) un its of flow into |Proj(/)| parts. Namely, for each 
e 6 Proj(/), we route u(e) units of this flow along the path constituted by paths path T (e), path^(e) 
and the edge e. Once again, it is not hard to convince oneself that such a flow does not overflow 
the capacities of edges of H(T, F) by a factor of more than three. The lemma follows. 

It is easy to see that every j-tree is an almost- j-tree, however the converse does not necessarily 
hold - one can consider an almost-2-tree corresponding to a cycle. Fortunately, the following lemma 
proves that every almost- j-tree G is close - with respect to its cut structure - to some 0(j)-tree G 
and this G can be found efficiently. 

Lemma 5.8. Let G' = (V' , E' ,u') be an almost- j-tree, we can obtain in 0{\E'\) time an 0(j)-tree 
G = (V',E,u) such that G' is embeddable into G and G is 3-embeddable into G' . Furthermore, the 
capacity ratio of G is at most twice the capacity ratio of G' . 

Proof: Let us assume first that G' does not have vertices of degree one - we will deal with this 
assumption later. Let W C V be the set of all vertices of degree two in G 1 . It is easy to see that 
we can find in 0(|I?'|) time a collection of edge-disjoint paths pi, ■ ■ ■ ,Pk covering all vertices in W 
such that in each all the internal vertices are from W and the two endpoints v (j>i) and v 2 (pi) 
of pi are not in W, i.e. they have degree at least three in G' . 

We construct G by first taking the subgraph H' of G' induced by the vertex set V' \ W and 
an empty forest F. Next, for each path pi, we repeat the following. Let e m \ n (pi) be the edge of pi 
that has minimal capacity u'(e) among all the edges e of pi. We add to both G and F the path pi 
with e m \ n {pi) removed . We set the capacities u of these added edges to be equal to their capacity 
in G' increased by the capacity u'(e m i n (pj)) of e m i n (pj). Finally, we add an edge (v 1 (pi), v 2 (pi)) to 
G with capacity equal to ti'(e m i n (pj)). See Figure EJ Clearly, such a construction can be performed 
in 0(|£J'|) time and the capacity ratio of G can be at most twice the capacity ratio of G'. 

We claim that the graph G obtained above is a (3j — 2)-tree with F being its envelope and its 
core being the subgraph of G induced by vertex set V'\W (note that this subgraph consists of H' 
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Figure 5: a) An example of a path pi found in G' . b) Transformed path pi in the resulting graph 
G. 

and all the edges {(v 1 (pi), v 2 (Pi))}i)- One can see that we only need to argue that |V'\ W\ < 3j + 2, 
the rest of the claim follows immediately from the above construction. To this end, note that since 
G' is an almost- j-tree at least \V'\ — j of its vertices is incident only to edges of the underlying 
spanning tree on V'. This means that the total sum of degrees of these vertices in G' is at most 
2(| V'\ — 1). But there is no vertices in G' with degree smaller than two, thus a simple calculation 
shows that at most 2(j — 1) of them can have degree bigger than two. We can conclude therefore 
that \W\ > \V'\ - j - 2(j - 1) and \V \W\ < 3j - 2, as desired. 

Now, to see that G' embeds into G, we note that H' is already contained in G. So, we only 
need to take care of the paths {pi}%. For each path pi, G already contains all its edges except 
e m in(Pi)- But this edge can be embedded into G by just routing the corresponding flow from one of 
its endpoints, along the fragment of path leading to v 1 (pi), then through the edge (v 1 (pi), v 2 (pi)) 
and finally back along the path to the other endpoint. It is easy to verify that our way of setting 
up capacities ensures that this routing will not overflow any of them. 

Similarly, to 3-embed G into G' we first embed H' into G' via identity embedding. Subsequently, 
for each path pi, G' already contains all its edges, so we can use identity embedding for them as 
well. Furthermore, to embed the remaining edges {(v 1 (pi),v 2 (pi))}i we can just route, for each 
i, the corresponding flow from v 1 (pi) to v 2 {pi) along the path p^. Once again, one can convince 
oneself that by definition of e m i n (pj) the resulting embedding does not overflow any capacities in 
G' by a factor of more than three. 

To conclude the proof, it remains to explain how to deal with graph G' having vertices of degree 
one. In this case one may preprocess G' as follows. We start with an empty forest F and as long 
as there is a degree one vertex in G' we remove it - together with the incident edge - from G' and 
we add both the vertex and the edge to F . Once this procedure finishes the resulting graph G' will 
not have vertices of degree one and still will be an almost- j -tree. Thus, we can use our algorithm 
described above and then just add F to the (3j — 2)-tree that the algorithm outputs. It is easy 
to see that the resulting graph will be still a (3j — 2)-tree - with F being part of its envelope - 
having all the desired properties. 



25 



Note that the above lemmas can be combined to produce - in 0(m) time - for a given graph 
H(T,F) an 0(|-F|)-tree that preserves the cut-flow structure of H(T,F) up to a factor of nine. 
This allows us to prove Theorem 13.61 

Proof of Theorem \3.(k By Corollary 15. 6l we can compute in 0(tm) time a i-sparse (a', H[0( ml ° sU )])- 
decomposition {(Xi, H(Ti, Fi))}i of G with a 1 being O(logra). Now, consider a convex combination 
{(Aj, Gi)}i with each Gi being the 0( m ? s )-tree produced by applying Lemma lSTTl and then Lemma 
15.81 to the graph H(Ti,Fi). Clearly, we can obtain this combination in 0(tm) time. 

We claim that this combination is a t-sparse (0(log n), £[0(^M)])-decomposition of G. Ob- 
viously, Yli^i = 1 an d G is embeddable into each Gi since G is embeddable into each H(Ti,Fi) 
and each H(Ti,Fi) is embeddable into corresponding Gi. Similarly, by composing the 9-embedding 
of Gi into H(Ti,Fi) and the identity embedding of H(Ti,Fi) into G (cf. embeddings {fj}j in Defi- 
nition [37T]) , we get that {(Aj,Gj)}j satisfies all the requirements of a i-sparse (9a', Gv[0( m ° s )])- 
decomposition. Furthermore, by Lemma 15.71 and Lemma 15.81 the capacity ratio of each Gi is at 
most 2mU . The theorem follows. 



6 Proof of Theorem 13.7 

The heart of the algorithm that will prove Theorem l3.7l is the procedure Find_Cut-preserving_Trees(G, t, j) 
described in Figure [6j On the high level, this procedure given an input graph G aims at finding a 
j-tree that a-preserves the cuts of G with some probability p, where both a and p depend on the 
parameters t and j. This is done by first sparsifying G (cf. Theorem 1 2. 3 p to reduce the number of 
its edges, then using the decomposition of Theorem l3.6l to represent the resulting graph as a convex 
combination of t simpler graphs (./-trees, for some j' > j). Next, one of these simpler graphs H 
is sampled - in the spirit of Fact 13.41 - and if H is already a j-tree then it is returned. Otherwise, 
the procedure recursively finds a j-tree G that approximates the cuts of the core of H - thus G 
is only defined on the vertex set of this core - and returns a j-tree that approximates the cuts of 
the whole H - and thus of G - by just adding the envelope of H to G . An important detail is 
that whenever we recurse, we increase the value of parameter t by squaring it. Intuitively, we may 
employ this more aggressive setting of the value of t since the size of the problem - i.e. the size of 
the core of H — decreased by a factor of at least t. 

To justify our recursive step we prove the following lemma. 

Lemma 6.1. Let H = (V,E,u) be a j' -tree and let H' = (V',E',u) be its core. Also, let G' = 
(V',E,u) be a j-tree on vertex set V' that a-preserves the cuts of H' with probability p, for some 
a > 1 and p > 0. Then the graph G being a union of G and the envelope F of H is a j-tree on 
vertex set V that a-preserves the cuts of H with probability p. 

Proof: Note that for any cut J/CcV, the capacity u(C) of this cut in H is equal to the capacity 
uh'(C) of this cut with respect to edges of H' plus its capacity uf(C) with respect to edges of 
F. Similarly, the capacity of C in G is equal to its capacity u(C) in G plus the capacity up(C). 
Therefore, the lemma follows by the fact that G a-preserves the cuts of H' with probability p and 
that after adding an envelope F to it G is still a j-tree. 

We proceed to analyzing the running time and the quality of cut preservation offered by the 
j-tree found by the procedure Find_Cut-preserving_Trees(G, t, j). The proof of the following lemma 
can be found in Appendix lC.il 
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Procedure Find_Cut-preserving_Trees(G, t,j): 

Input : Graph G = (V,E,u), parameters t > 1, and j > 1 
Output: A j-tree G on vertex set V 

Obtain a sparsification G of G as in Theorem 12.31 with 8=1 

Find a i'-sparse (0(log <7y[max{-p, l}])-decomposition {{\i,Gi)}i of G as in Theorem 13.61 
where t' is chosen so as to make the decomposition consist of max{^-, l}-trees 
Sample a graph H by choosing G; with probability A^, as in Fact 13.41 
if H is a j-tree then 
I G^H 
else 

Let H' be the core of H and let F be the envelope of H 

Invoke Find_Cut-preserving_Trees(iJ', t 2 , j) to obtain a j-tree G on vertex set V C V - of the 
graph if' 

Add F to G to get a j-tree G on vertex set V 
end 

return G 

Figure 6: Procedure Find_Cut-preserving_Trees(G, t, j) 



Lemma 6.2. For a given graph G = (V, E, u) and j > 1, and t > 2, Procedure Find_Cut-preserving_Trees(G, t, j) 

works in 0(m + tnT 2 log U) time and returns a j-tree G that (log^ 1+ °^ T n) -preserves the cuts of 
G with probability (1/2) T , where T < max{ [log(log t (n/j) + 1)], 1}, n = \V\, m = \E\, and U is the 
capacity ratio of G. Moreover, the capacity ratio of G is n^ +oi - 

Now, proving Theorem 13.71 boils down to running the Find_Cut-preserving_Trees(G, t, j) for the 
right setting of parameters and sufficiently many times, so as to boost the probability that the 
obtained collection of j-trees preserves cuts with high probability 

Proof of Theorem \3. 1\ To obtain the desired collection of n'-trees, we just invoke procedure 

Find_Cut-preserving_Trees(G, t, j) on G (2 fc+1 lnn) times with t := re 2 *- 1 . Note that for this choice 
of parameters we get that in the statement of Lemma I6.2I T < k. 

Therefore, this lemma allows us to conclude that each invocation of the procedure takes 
0(n( 1+1 / fc ) logU) time to execute and thus we obtain (2 fc+1 lnn) n'-trees {Gi}i in total time of 

0(m + 2 k n 1+ ^ log U). 

(We used here the fact that it is sufficient to sparsify G only once for all the procedure calls.) 

Furthermore, Lemma 16.21 implies that the capacity ratio of the obtained trees is at most 
n (2+o(i))kjj anc j gg^h particular n'-tree Gi (\og^ 1+ °^ k n)-preserves the cuts of G with probabil- 
ity (l/2) k . However, it is easy to see that this in turn means that the whole collection {Gi}i 
(l og (i+o(i))fc n ). pi . eservec i t he cuts of G with probability (1 - (1 - (l/2) k ) 2k+1 lnn ) = (1 - 1/n 2 ), as 
desired. The theorem follows. 

I 
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A Appendix to section [4] 



A.l Proof of Lemma 14.61 

Proof: We start by using Theorem 12.31 with 5 = 1 to sparsify the core of our j-tree G. This ensures 
that the resulting j-tree G has a core with 0(j) edges. Also, it is easy to see that we can focus 
on approximating our problems in this graph since any approximation obtained for G leads to an 
approximation for G that is only by at most a constant factor worse. 

For given e > 0, the algorithm of Sherman reduces the task of 0(y / log n/e)-approximation of 
the balanced partition and the sparsest cut problems in the graph G to solving a sequence of n £ 
instances of the following problem^. We are given a graph G being G with a source s and sink t 
added to it. We also add auxiliary edges that connect these two vertices with the original vertices 
corresponding to G. The capacities of the auxiliary edges can be arbitrary - in particular, they can 
be zero which means that the corresponding edge is not present - but we require that the value of 
the s-t cut C = {s} is at most n/2 (thus the throughput of the maximum s-t flow is also bounded 
by this value). The task is to find the maximum s-t flow in the graph G. 

Our goal is to show that we can solve any instance of the above problem in 0(n + j 3 ^ 2 ) time 
- this will imply our desired 0(m + n £ (n + j 3//2 )) total running time of Sherman's 0{\J\ognJe)- 
approximation algorithm and thus yield the proof of the lemma. 

To this end, we design a method of fast compression of the graph G to make it only consist of 
the core of G, the source s and the sink t, and the corresponding auxiliary edges (with modified 
capacities). This method will allow efficient - i.e. linear time - recovering from any maximum flow 
in the compressed graph a maximum flow in the original graph G. Moreover, the throughput of the 
maximum flow in the compressed graph will be at most n/2, as well. So, by running the algorithm 
of Goldberg and Rao [27] on this compressed graph and extending the computed maximum flow 
back, we will obtain a maximum flow in G in time 0(n + j 3 ^ 2 ), as desired. 

Our compression procedure works in steps - each such step is a local transformation that reduces 
the number of vertices and edges of the graph by at least one and can be implemented in constant 
time. In each transformation we consider a vertex v in the current version G' of G such that v has 
exactly one non-auxiliary edge e incident to it and this edge is a part of an envelope of G. As we 
will see, G' will be always a subgraph of the graph G and thus one can convince oneself that - as 
long as G' still contains some edges of the envelope of G - we can always find a vertex v as above. 

Assume first that there is at most one auxiliary edge that is incident to v. (For the sake of 
the argument, let us use here a convention that this single auxiliary edge e' is always present, but 
might have a capacity of zero.) In this case, we just contract the edge e and set the new capacity 
of e' to be the minimum of its previous capacity and the capacity of e. We claim now that given 
any maximum flow /" in the graph G" obtained through this contraction of e, we can extend it in 
constant time to a maximum flow /' in the graph G' . To achieve this, we just transplant the flow 
of |/"(e')| units that /" routes over the edge e' in G", to a flow in G' routed through the edges e 
and e' . Note that by definition of the capacity of e' in G" , the transplanted flow /' is feasible in G' 
and has the same throughput as /" had in G" . Moreover, one can see that the throughput of the 
maximum flow in G' can be at most the throughput of the maximum flow in G" since any minimum 
s-t cut in G" can be easily extended to a minimum s-t cut in G' that has the same capacity. Thus 
/' is indeed a maximum flow in G' . 

1 For our purposes, we use a slight generalization of the problem that is actually considered in |37| . 
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Now, we deal with the case when there are two auxiliary edges e' ,e" incident to v - wlog, let 
us assume that the capacity u" of e" is not bigger than the capacity u' of e'. To this end, we note 
that if we route u" units of flow from s to t along the path consisting of the edges e',e" then there 
still exists a maximum flow in G' such that its flow-path decomposition contains the flow-path 
corresponding to our pushing of u" units of flow over the edges e', e" . This implies that if we reduce 
the capacity of e' and e" by u" - thus reducing the capacity of e" to zero and removing it - and 
find the maximum flow /" in the resulting graph, we can still recover a maximum flow in G' by 
just adding the above-mentioned flow-path to /". But, since in the resulting graph v has at most 
one auxiliary edge incident to it - namely, e' - we can use our transformation from the previous 
case to compress this graph further. This will reduce the number of vertices and edges of G' by 
at least one while still being able to recover from a maximum flow in the compressed graph G" a 
maximum flow in G'. 

By noting that the compression procedure stops only when G' contains no more edges of the 
envelope of G and thus the final compressed graph indeed consists of only (sparsified) core of G, 
source s, and t, the lemma follows. 

I 

A. 2 Proof of Lemma [4.71 

Proof: Consider the case when we are interested in the sparsest cut problem - we will present 
variation for the balanced separator problem shortly. We sort the edges of G non-increasingly 
according to their capacities. Let e\,... ,&\e\ be the resulting ordering. Let r* be the smallest r 
such that the set {e\, . . . , e r } contains a spanning tree T of G. It is easy to see that we can find 
such r* in 0(|E|) time using union-find data structure. 

Now, let us look at the cut C in G that cuts only the edge e r * in T. Since no vertex of G can 
have degree bigger than \ V\, the sparsity of C is at most rt(e r *)|V|. On the other hand, any cut in 
G has to cut at least one edge e r with r < r*. Therefore, we know that the sparsity of the optimal 
cut in G has to be at least u(e r *)/\V\. This implies that C is the desired |y| 2 -approximation of the 
sparsest cut of G. 

In case of balanced separator problem, we define r* to be the smallest r such that the largest 
connected component of the subgraph E r spanned by the edges {e±, . . . , e r } has its size bigger than 
(1 — c)|V|, where c is the desired balance constant of our instance of the problem. Once again, 
one can easily find such r* in 0(|i?|) time by employing union-find data structure. Let F be the 
connected component of E r * containing e r *. Note that by minimality of r* , removing e r * from F 
disconnects it into two connected pieces and at least one of these pieces, say F', has to have at least 
(1 — c)|y|/2 and at most (1 — c)\V\ vertices. Let C be the cut corresponding to F' . Clearly, the 
sparsity of C is at most it(e r *)|y|. Furthermore, any cut C* with min{|C*|, |C*} > c\V\ has to cut 
some edge e r with r > r*. Therefore, the optimal solution can have sparsity at most w(e r *)/|y|. 
This implies that C is a c'-balanced separator with c' = min{(l — c)/2, c} and thus constitutes our 
desired | V| 2 -(pseudo-)approximation for our instance of the balanced separator problem. 
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B Appendix to section [5] 



B.l Proof of Lemma 15.11 

Proof: First, we show how to construct the set E[T](F) in 0(m). We do this by first associating 
with each vertex v a label indicating the connected component of the forest T \ F to which v 
belongs. This can be easily done in 0(n) time. Next, we start with E' := and for each e edge 
of G we check whether its both endpoints belong to the same connected component. If this is not 
the case, we add e to E' . Clearly, this procedure takes 0{m) time and the resulting set E' is equal 
to E[T](F). Now, we obtain H(T,F) by just taking the union of the edges from E[T](F) - with 
capacities given by the capacity function u - and of the forest T\F — with capacities given by 
the capacities given by capacity function u T that, as we will see shortly, can be computed in 0(m) 
time. 

To compute the capacity function u T , we just note that for any e G Et, u T (e) = u T [G](e), where 
u T [G](e) is defined in section I4TT1 Thus we can use the algorithm from Lemma 14.21 to compute all 
the u T (e) in 0(m) time. 

B . 2 Proof of Theorem [Q 

Proof: Let {G{\i be an enumeration of all the graphs from Q into which G is embeddable. Also, 
let us introduce a coefficient \ > for each such Gi - despite the fact that the number N of 
such GiS can be very big, we will end up having only small number of A,'s non-zero. For a given 
graph Gi = (V,Ei,Ui) and an edge e G E, let us define rloadc^e) := = , where we use 

a convention that 'Uj(e) = if e ^ Ei (recall that we require Gi to be a subgraph of G and thus 
Ei C E). 

Following the approach from [36], let M be an \E\ x N matrix with M e ^ := rloadc^e)- Clearly, 
if we consider a vector A = (Ai, . . . , Ajv) then the inequality M • A < for some j3 > 0, ensures 
that in the convex combination {(Aj,Gj)}j of graphs Gi we will have ^ A^rload^ (e) < j3 for each 
edge e G E. This in turn means that ^2i^iUi(e) < f3u(e). So, if we consider the following set of 
inequalities 

Imax(MA) < 
E A * ^ 1 

i 

ViA, > 

where lmax(x) := In ^2 eeE exp(x e ) > max e ^E x e, then a convex combination {(Aj,Gj)}j corre- 
sponding to any A satisfying them will constitute a (^-decomposition of G with the embeddings 
/i's being just the identity embeddings of GVs into G. Also, note that lmax(x) > lnm for any 
non-negative x, thus (5 has to be always at least lnm. 

The analysis from [36] shows that if for any length function / we are always able to find a graph 
GWn such that l(G^) < al(G) then a vector A can be obtained that satisfies the above set of 
inequalities with (3 equal to a + logm < 2a@ The vector A is found as follows. We start with 
A = 0. Then, as long as J2i Aj < 1, we define a length function I on G by setting 

12 To understand how this statement follows from the analysis presented in [36], one should note that when we 
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1(e) := ^((MX) e ) ^ (2) 

u ( e ) Ee'ei? exp((MA) e /) 

for each e € E, and find the corresponding graph G^ny Next, we increase the value of X^ by 
min{7(Gj(;)), 1 — Yli Xi} an d repeat the procedure if ^ Xi is still smaller than one. 

We see therefore that if we use the above algorithm and each time, for the given length function 
I, we supply G^m corresponding to Hi from the assumptions of the theorem - note that Hi is in Q 
and G is embeddable into Hi, so Hi is enumerated among the graphs {Gi}i - then we will obtain 
a vector A that corresponds to a (2a, (^-decomposition of G. 

To bound the time taken by this algorithm, we need to bound the number of iterations - by 
the assumptions of the theorem, each iteration can be implemented in 0(m) time. Also, note that 
the number of iterations determines the sparsity of the obtained decomposition. 

To this end, let us introduce a potential function </>(A) := E eG £ Yli Xif\oadc t (e) that was used 
in [36]. Note that: initially </>(A) = 0(0) = 0; the potential is only increasing throughout the 
algorithm; and at the end 

4>(X) = ^2 ^2 Airload Gi (e) < 2am, 

e<=E i 

since J2i ^ifloadG^e) < Imax(MA) < 2a for every edge e. 

However, by definition of Hi, whenever we increase by j(Hi) cft(X) increases by at least 
\k(Hi)\/2, since for each e G k(Hi), 

TWJrtad^M = M > I. 

Therefore, by property ([m]) of Hi, we have at most t iterations and the theorem follows. 



C Appendix to section [6] 
C.l Proof of Lemma 16.21 

Proof: Let us define ni to be an upper bound on the number of the vertices of the graph that 
we are dealing with at the beginning of the Z-th recursive call of the procedure. Also, let t\ be the 
value of t and let U\ be the upper bound on the capacity ratio of this graph at the same moment. 
Clearly, no = n, to = t, and Uq = U. By solving simple recurrence we see that 

t, = t 2 ' (3) 

and thus 

"hi < ? < ^rr- (4) 



tl t 



consider the length function defined in ((2} then - using the terminology of [36] - 1(d) = ^ e rloadoi (e)partial^(M A) 
partial, (A) and 1(G) = 1. 
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Now, by Theorem 12.31 and Theorem 13.61 we know that Ui+i is 0(nfUi). Moreover, by the fact 
that we sparsify G first and by Theorem 13.61 we know that in Z-th iteration it is sufficient to take 
t' = t[ii, where ij is log°^ nlogUi. 

Note that every graph on at most j vertices is a j-tree thus the procedure stops issuing further 
recursive calls once the number of vertices of G is at most j. So, the number of recursive calls of 
our procedure is at most T — 1, where 

T := max{riog(log t (n/j) + 1)1,1}. 

As a result, the bound on the capacity ratio of G is Ut < n^ 2+ °^ T U , as desired. 

To bound the total running time of the procedure, consider its modification in which we ad- 
ditionally sparsify the graph H' before passing it to the recursive call. Clearly, running time of 
such modification can only increase. Note that in this case all but the topmost invocation of the 
procedure deals from the beginning till the end with sparsified version of the graph. Therefore, 
the time need to execute Z-th recursive call, for I > 1, is 0(tibin(). By equations (|3|) and (j3J), this 
implies that the total time taken by the execution of our procedure is: 



O(m) + Oihtim) < 0{m + U 2 »°i ' ) ^ °( m + tn Yl log U fi = °( m + tnT2 log C7 )' 

1=0 1=0 * 1=0 

where 0(m) is the cost of the initial sparsification of G. 

To establish the desired bound on the quality of the preservation of cuts of G by G, we prove 
that for all Z > 0, if the execution of the procedure made I recursive calls then the graph G 
(Iog( 1+O ( 1 ^' +1 ) n)-preserves the cuts of G with probability (l/2)' +1 . Clearly, this claim implies the 
desired bounds by combining it with the fact that I < T — 1. 

We prove the claim inductively. For I = it follows from Theorem 12.31 Theorem 13.61 and Fact 
13.41 Now, assume that the claim holds for I > and we will prove it for I + 1. By Theorem I2.3[ 
Theorem l3.61 and Fact 13. 4( we know that H is (log 1+o( - 1 ^ n)-preserving the cuts of G with probability 
1/2. So, by our inductive assumption we know that G (\og^ 1+ °^^ l+1 ^ n)-preserves the cuts of 
H' with probability (l/2) m . Therefore, by Lemma Owe know that G also (l g( 1+0 ( 1 ))( z+1 ) n)- 
preserves the cuts of H with probability (l/2) i+1 which - by the above-mention relation of the 
cut-structure of H to the cut-structure of G — implies that G (log( 1+0 ( 1 ))^ +2 -' n)-preserves the cuts 
of H with probability (l/2)' +2 . So, our claim holds for I + 1 and thus for all the values of I. This 
concludes the proof of the lemma. 
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